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1. Field of the Invention 

This invention relates to computer systems. In particular, the 
invention relates to circuit design techniques and computer-aided design 
5 (CAD) software tools. 

2. Description of Related Art 

In the design of circuits of a system, the design engineer typically faces 
with a number of decisions based on system requirements and criteria. The 
10 system requirements and criteria usually dictate the specifications of the 

circuits in the system in terms of design constraints. The design constraints 
limit the choice of design techniques, component selection, design 
parameters, etc. The design engineer has to satisfy the design constraints and, 
at the same time, optimize the design in terms of other design parameters. 
15 This task becomes increasingly difficult as the complexity of the system 

increases. The design engineer has to perform complex trade-off analysis to 
SS optimize the design from a number of design parameters. Examples of the 

design parameters include propagation delay and the power consumption of 
the circuit. 

20 Propagation delay and power consumption are two important design 

parameters. As microprocessor technology is becoming more advanced in 

/ 

speed and complexity, device power consumption increases significantly. 
Even with processor operating voltage reduction, device power consumption 
still grows at a rate of several orders of magnitude. This is largely due to an 
25 increased use of on-chip hardware to get parallelism and improve 
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microprocessor performance. In addition, to get extra performance on certain 
critical timing paths, device sizes are being optimized to provide faster delays 
at the circuit level. However, size optimization in given design is a very time 
consuming process. Often, the penalty of upsizing transistors to get 
performance boosts comes at the expense of a much larger increase in circuit 
power consumption. 

When a system or a functional block consists of many circuits, it is 
difficult to optimize the design while still meeting the design constraints. 
Currently, there is no known technique to allow the design engineer to 
optimize the overall design in a systematic and efficient manner. The design 
engineer usually works on each circuit separately and performs incremental 
optimization. This process is tedious and does not give the design engineer a 
global picture of the entire design. 

Therefore there is a need in the technology to provide a simple and 
efficient method to optimize the design. 
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SUMMARY 



The present invention is a method and computer program product for 
determining optimal values of design parameters of a subsystem to meet 
design constraints. The subsystem comprises a plurality of circuits. 
Parameter functions are created for the corresponding circuits. The parameter 
functions represent a relationship among the design parameters. The design 
parameters are optimized based on the parameter functions to satisfy the 
design constraints. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The features and advantages of the present invention will become 
apparent from the following detailed description of the present invention in 
which: 

Figure 1A is a diagram illustrating an engineering design cycle in 
accordance with the teachings of the invention. 

Figure IB is a diagram illustrating a computer system in which one 
embodiment of the present invention may be utilized. 

Figure 2 is a diagram illustrating a design optimization phase according 
to one embodiment of the invention. 

Figure 3 is a diagram illustrating an environment for the power 
modules using tools according to one embodiment of the invention. 

Figure 4 is a diagram illustrating power-delay curves according to one 
embodiment of the invention. 

Figure 5 is a diagram illustrating an example of an arithmetic logic unit 
datapath functional block according to one embodiment of the invention. 

Figure 6A is a diagram illustrating a power-delay curve for the input 
multiplexer shown in Figure 5 according to one embodiment of the 
invention. 

Figure 6B is a diagram illustrating a power-delay curve for the 
comparator shown in Figure 5 according to one embodiment of the 
invention. 
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Figure 6C is a diagram illustrating a power-delay curve for the static 
adder shown in Figure 5 according to one embodiment of the invention. 

Figure 6D is a diagram illustrating a power-delay curve for the output 
multiplexer shown in Figure 5 according to one embodiment of the 
invention. 

Figure 7 is a diagram illustrating a comparison of the power-delay 
curves for the three different implementation of an example circuit according 
to one embodiment of the invention. 

Figure 8 is a diagram illustrating a design process according to one 
embodiment of the invention. 
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DESCRIPTION 

The present invention is a method and computer program product for 
determining the optimal values of the design parameters of a circuit block. 
Parameter functions relating the design parameters for circuits in the circuit 
5 block are created. Based on these parameter functions, the design parameters 
are optimized to satisfy the design constraints. In one embodiment, the 
design parameters include power and delay and the parameter functions are 
power-delay curves. The power-delay curves are generated using a timing 
simulator, a power estimator, and transistor sizing tools. The invention 
10 provides a technique to help designer to perform trade-off analysis for 
optimizing the design while meeting the design constraints. 

In the following description, for purposes of explanation, numerous 
details are set forth in order to provide a thorough understanding of the 
present invention. However, it will be apparent to one skilled in the art that 
15 these specific details are not required in order to practice the present 

invention. In other instances, well known electrical structures and circuits 
are shown in block diagram form in order not to obscure the present 
invention. 

In a circuit design, the design engineer typically faces with a number of 
20 design parameters and design constraints. The design constraints are usually 
dictated by the system requirements and specifications. Examples of the 
design constraints include propagation delay, power consumption, packaging, 
number of input/ output (I/O) lines, etc. The design constraints are typically 
imposed on one or more design parameters, while leaving other parameters 
25 to be optimized to achieve high performance. The design parameters, 
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therefore, are divided into two parameter sets: a constraint set and an 
optimizing set. The "constraint set" includes constraint parameters which are 
the parameters that have to meet the design constraints. The "optimizing 
set" includes the optimizing parameters which are the parameters that need 
5 to be optimized. In an exemplary scenario, a constraint parameter is the 
propagation delay and an optimizing parameter is the power consumption. 
In another scenario, the propagation delay is the optimizing parameter and 
the power consumption is the constraint parameter. 

The relationship between the constraint parameters and the 
10 optimizing parameters is described by a parameter function. A "parameter 
function" describes the variation of one parameter as a function of another 
parameter. For example, a parameter function may describe the variation of 
the power consumption as a function of the delay. The variation of one 
parameter as a function of another is typically caused by a configuration of the 
15 circuit such as the size of the transistors, the choice of circuit technology (e.g., 
domino versus static), etc. A configuration of the circuit that gives rise to the 
particular values of the design parameters corresponds to a design point. 

A system, a subsystem, a module or a functional block may consist of a 
number of circuits. Each circuit is characterized by a parameter function. 

20 Optimizing the design of a subsystem or functional block involves a trade-off 
consideration of all the parameter functions of all the individual circuits of 
the subsystem or functional block. For a parameter function of a given 
circuit, there are many design points corresponding to different circuit 
configurations. Therefore, optimizing a subsystem or functional block 

25 involves the selection of the design points on the parameter functions that 
provide the optimal values of the optimizing parameters and acceptable 
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values of the constraint parameters. The present invention provides a 
technique to optimize the overall design using the parameter functions. 

Figure 1A is a diagram illustrating an engineering design cycle in 
accordance with the teachings of the invention. The engineering design cycle 
5 100 includes a first logic synthesis phase 110, a circuit design phase 120, a 
design optimization phase 130, and a second logic synthesis phase 140. 

The first logic synthesis phase 110 provides the high level logic 
description and/ or design of the circuits. In the first logic synthesis phase 
110, the designer synthesizes the circuits manually or using a number of tools 
10 including Computer- Aided Design (CAD) tools. Examples of CAD tools 

include hardware description language (HDL) compilers, and schematic entry 
tool. The result of the first logic synthesis phase 110 includes the design in 
high level form such as a textual description of circuit at the behavioral level, 
register transfer language (RTL), or microarchitecture. 

15 The circuit design phase 120 receives the generated logic synthesis files 

to generate the synthesized circuits. The synthesized circuits may be 
represented by circuit schematics, a netlist of the circuits, or any other 
convenient form that can be further processed by additional CAD tools. 
Essentially, the circuit design phase 120 represents an unoptimized complete 

20 design that shows subsystems or functional blocks at the detailed 
implementation level. 

The design optimization phase 130 determines the optimal values for 
the design parameters to meet the design constraints. In the design 
optimization phase 130, the design engineer uses a design workstation or a 
25 computer system 132. The computer system 132 is supported by a design 
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environment which includes the operating system and many CAD tools such 
as timing analyzer, power estimator, transistor sizing tool to adjust the design 
parameters according to the allowable design budgets. The design 
optimization phase 130. typically produces a number of parameter functions 
5 that relate the design parameters for the circuits. An example of such a 
parameter function is a power-delay curve 135. The power-delay curve 135 
shows the relationship between the power consumption and the propagation 
delay for a particular circuit in a functional block. The power-delay curve 135 
has a number of design points corresponding to different implementations or 
10 configurations of the circuit under consideration. The power-delay curve 135 

o provides the design engineer the basic information to optimize his or her 

=2 circuit under the specified design constraints. 

Jrl As shown in Figure 1A, from the information provided by the power- 

Si delay curve 135, the design engineer modifies the circuit design according to 

JU 15 the design points. The exemplary power-delay curve 135 has three design 

points A, B, and C. The design point A corresponds to a circuit 
+; implementation that has high power consumption and fast speed, 

GJ representing an undesirable implementation because of excessive power 

consumption. The design point B corresponds to the optimal power 
20 consumption and optimal speed, also representing the best circuit 

implementation. The design point C corresponds to low power consumption 
and acceptable speed, representing a desirable implementation. If the circuit 
implementation is at the design point A, the design engineer will have the 
option to go back to the first logic synthesis phase 110 or the circuit design 
25 phase 120. If the circuit implementation is at the design point C, the design 
engineer will go to the second logic synthesis phase 140. 



042390.P5512 -9- Patent Application 

WWS/TVN/mns 




The second logic synthesis phase 140 is essentially the same as the first 
logic synthesis phase 110 with the exception that the design engineer now 
focuses more on giving the extra design margin to other circuits in the 
subsystem or functional block. The low power consumption at the design 
5 point C provides more margin to the power budget for other circuits. In the 
second logic synthesis phase 140, the design engineer modifies the circuit 
synthesis based on the extra margin, such as repartitioning, floor-plan editing, 
etc. 

Figure IB is a diagram illustrating one embodiment of a computer 
10 system 132 in which one embodiment of the present invention may be 

utilized. The computer system 132 comprises a processor 150, a host bus 155, a 
peripheral bridge 160, a storage device 165, an advanced graphics processor 
175, a video monitor 177, and a peripheral bus 180, 

The processor 150 represents a central processing unit of any type of 
15 architecture, such as complex instruction set computers (CISC), reduced 
instruction set computers (RISC), very long instruction word (VLIW), or 
hybrid architecture. The processor 150 is coupled to the peripheral bridge 160 
via the host bus 155. While this embodiment is described in relation to a 
single processor computer system, the invention could be implemented in a 
20 multi-processor computer system. 

The peripheral bridge 160 provides an interface between the host bus 
115 and a peripheral bus 180. In one embodiment, the peripheral bus 180 is 
the Peripheral Components Interconnect (PCI) bus. The peripheral bridge 160 
also provides the graphic port, e.g., Accelerated Graphics Port (AGP), or the 
25 graphics bus 172 for connecting to a graphics controller or advanced graphics 
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processor 175. The advanced graphics processor 175 is coupled to a video 
monitor 177. The video monitor 177 displays graphics and images rendered 
or processed by the graphics controller 125. The peripheral bridge 160 also 
provides an interface to the storage device 165. 

The storage device 165 represents one or more mechanisms for storing 
data. For example, the storage device 165 may include non-volatile or 
volatile memories. Examples of these memories include flash memory, read 
only memory (ROM), or random access memory (RAM). Figure IB also 
illustrates that the storage device 165 has stored therein data 167 and program 
code 166. The data 167 stores graphics data and temporary data. Program code 
166 represents the necessary code for performing any and/or all of the 
techniques in the present invention. Of course, the storage device 165 
preferably contains additional software (not shown), which is not necessary to 
understanding the invention. 

The peripheral bus 180 represents a bus that allows the processor 150 to 
communicate with a number of peripheral devices. The peripheral bus 180 
provides an interface to a peripheral-to-expansion bridge 185, peripheral 
devices 190i to 190n/ a mass storage controller 192, a mass storage device 193, 
and mass storage media 194. The peripheral devices 190i to 190n represent 
any device that is interfaced to the peripheral bus 180. Examples of peripheral 
devices are fax/modem controller, audio card, network controller, etc. The 
mass storage controller 192 provides control functions to the mass storage 
device 193. The mass storage device 193 is any device that stores information 
in a non-volatile manner. Examples of the mass storage device 193 includes 
hard disk, floppy disk, and compact disk (CD) drive. The mass storage device 
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193 receives the mass storage media 194 and reads their contents to configure 
the design environment for the design engineer. 

The mass storage media 194 contain programs or software packages 
used in the environment. The mass storage media 194 represent a computer 
5 program product having program code or code segments that are readable by 
the processor 150. A program code or a code segment includes a program, a 
routine, a function, a subroutine, or a software module that is written in any 
computer language (e.g., high level language, assembly language, machine 
language) that can be read, processed, compiled, assembled, edited, 

10 downloaded, transferred, or executed by the processor 150. The mass storage 
media 194 include any convenient media such as floppy diskettes, compact 
disk read only memory (CDROM), digital audio tape (DAT), optical laser disc, 
or communication media (e.g., internet, radio frequency link, fiber optics 
link). For illustrative purposes, Figure IB shows floppy diskettes 195 and 

15 compact disk read only memory (CDROM) 196. The floppy diskettes 195 and/ 



Jjj or CDROM 196 contain design environment 198. Examples of the tools or 

Jp computer readable program code in the design environment 198 include 

CO operating system, computer aided design (CAD) tools such as schematic 

capture, hardware description language (HDL) compiler, text editors, netlist 
20 generator, timing analyzer, power vector generator, timing simulator, power 
simulator, circuit configuration, component sizer, parameter function 
generator, parameter optimizer, and graphics design environment. These 
tools, together with the operating system of the computer system 132 form the 
design environment 198 on which the design and optimization process can be 
25 carried out. 



042390.P5512 
WWS/TVN/mns 



-12- 



Patent Application 



The peripheral-to-expansion bridge 187 represents an interface device 
between the peripheral bus 180 and an expansion bus 187. The expansion bus 
187 represents a bus that interfaces to a number of expansion devices 188l to 
188K- Example of an expansion device includes a parallel input/ output (I/O) 
5 device, a serial communication interface device. In one embodiment, the 
expansion bus 187 is an Industry Standard Architecture (ISA) or Extended 
Industry Standard Architecture (EISA) bus. 

The computer system 132 can be used in all or part of the phases of the 
design process. The processor 150 execute instructions in the program 166 to 
10 access data 167 and interact with the design environment 198. In particular, 
the computer system 132 is used in the design optimization phase 130. 

Figure 2 is a diagram illustrating a design optimization phase according 
to one embodiment of the invention. The design optimization phase 130 
includes a netlist generation module 210, a critical path generation module 

15 223, a power vector generation module 227, a delay calculation module 233, a 
power calculation module 237, a circuit configuration module 240, a 
parameter function generation module 250, and an optimization module 260. 
Each of these modules may be a software module or a hardware module or a 
combination of both. In one embodiment, these modules are implemented 

20 by program code that are readable and executed by the processor 150. 

The netlist generation module 210 generates the circuit netlist which 
provides the information on component identification and how the 
components of the circuit are interconnected. The circuit netlist becomes the 
input to the critical path generation module 223 and the power vector 
25 generation module 227. The critical path generation module 223 generates 
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timing delays of various paths in the circuit based on circuit components and 
interconnection patterns. From these timing delays, the critical path(s) is (are) 
identified. The critical path represents the path through which the overall 
propagation delay is the most critical, e.g., timing parameters (e.g., setup time, 
5 hold time) are difficult to satisfy. The timing files generated by the critical 
path generation module 223 become the input to the delay calculation 
module 233. The delay calculation module 233 calculates the delays of the 
critical paths and other paths using a timing simulator. In one embodiment, 
the timing simulator is the PathMill tool, developed by Epic Technologies, 
10 now owned by Synopsys, of Mountain View, California. The timing values 
q are then forwarded to the circuit configuration module 240. On the power 

rf side, the power vector generation module 227 generates power vectors as 

input to the power calculation module 237. The power calculation module 
237 calculates the power consumption of the circuit using a power estimator 
sy 15 tool. In one embodiment, the power estimator tool is the PowerMill tool, 

y developed by Epic Technologies of Mountain View, California. The power 

O values are then forwarded to the circuit configuration module 240. 

s 

ty The circuit configuration module 240 configures the circuit to 

effectuate the power consumption and delay. One configuration is scaling the 

20 sizes (e.g., transistor size) of the circuit components using a sizing tool. In one 
embodiment, the sizing tool is Amps developed by Epic Technologies of 
Mountain View, California. The sizing tool applies scale factors to scale down 
the circuit elements either globally or locally. The resulting circuit is then 
simulated again for the next delay and power values. The circuit 

25 configuration module 240 generates new circuit information to be fedback to 
the delay calculation module 233 and the power calculation module 237. The 



042390.P5512 -14- Patent Application 

WWS/TVN/mns 




process continues until all the values within the range of the scaling have 
been used. Then the delay and power values are forwarded to the parameter 
function generation module 250. The parameter function generation module 
250 generates the parameter function (e.g., power-delay curves) showing the 
5 relationship between the design parameters. The parameter function 

generation module 250 may also generate the design parameters in any other 
convenient forms for later processing. 

The optimization module 260 receives the values of the design 
parameters either in the form of a parameter curve, or in any other 
10 convenient format. The optimization module 260 determines the optimal 
values of the design parameters. 

Figure 3 is a diagram illustrating an environment for the power 
modules using tools according to one embodiment of the invention. The 
environment 300 includes a format converter module 310, the power vector 
15 generation module 227, and the power calculation module 237. 

The format converter module 310 receives a default file 312 and Intel 
SPice (ISP) files 314, and generates a netlist file (.ntl) 318. In one embodiment, 
the format converter module 310 uses a program called ISPECE2 to map 
interconnect models and assign transistor sizes for transistors. The ISPECE2 

20 program converts a netlist file into another netlist file which has a format 

compatible with the program used in the power calculation module 237. The 
default file 312 is used by the ISPECE2 program and provides some default 
information based on the current circuit technology. The ISP files are circuit 
description files or netlist files that describe the transistor sizes, cell names, 

25 and circuit connectivity for the designs. The .ntl file 318 is a file containing 
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the circuit description and/ or netlist file in a format compatible with the 
power calculation module 237 (e.g., the Epic format which is a format 
compatible with tools by Epic Technologies). 

The power vector generation module 227 receives a command text 
5 (.tcmd) file 322 and generates a vector (.vec) file 328. In one embodiment, the 
power vector generation module uses an Intel Vector Generation (iVGEN) 
program. The .tcmd file is a small text file which has a list of input pins for a 
given circuit. The program iVGEN uses the .tcmd file to generate vectors 
corresponding to the pins listed in the .tcmd file. A script program takes the 
10 circuit's ISP file 314 and generate the .tcmd file. The .vect file 328 is the file 
containing the vectors needed to run the power calculation module 237. The 
.vec file basically contains the time steps and a list of binary values (O's and 
l's) for each input in the circuit. 

The power calculation module 237 receives the .ntl file 318, the .vec file 
15 328, a circuit technology (.tech) file 332, a configuration (.cfg) file 334, and a 
capacitance (.cap) file 336, and generates a log (.log) file 342 and an error (.err) 
y3 file 344. In one embodiment, the power calculation module 237 uses a power 

estimator tool, e.g., PowerMill. The .tech file is a circuit technology 
dependent file that lists the process parameters for a given circuit technology. 
20 This file is used by the PowerMill tool to calculate the current consumption 
of each transistor in the circuit. The .cfg file 334 is a command file for the 
PowerMill. The .cfg file 334 is a user generated text file used to instruct the 
PowerMill what kind of outputs to generate. The command line syntax is 
according to the PowerMill format. The .cap file 336 is a file containing 
25 parasitic capacitance from the layout if the layout exists. If the circuit is 
designed before the layout, there is no .cap file 336 and the PowerMill 
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estimates the parasitic capacitance. The .log file 342 contains the output of the 
power calculation module 237. The .log file 342 contains a list of average and 
peak current consumption for the entire simulated circuit. The .err file 344 is 
a file contain errors that may occur during the simulation. Examples of the 
5 errors include incomplete input, vectors or syntax problems in the input files. 

Figure 4 is a diagram illustrating a power-delay curve according to one 
embodiment of the invention. The power-delay curves show two curves: a 
domino curve 410 and a static curve 420. 

The power-delay curves in Figure 4 show the parameter function for 
an arithmetic circuit. The arithmetic circuit can be designed using a domino 
circuit technology or a static circuit technology. The domino curve 410 is the 
power-delay curve for the circuit using the domino circuit technology and the 
static curve 420 is the power-delay curve for the circuit using the static circuit 
technology. 

The domino curve 410 has two design points A and B. The design 
point A corresponds to the current domino design. At this design point, the 
circuit has a delay of approximately 1.35 nsec and a power consumption of 
approximately 14 mA. The design point B corresponds to another domino 
design with longer delay at approximately 1.62 nsec and a power consumption 
of approximately 6.1 mA. Therefore the saving in power to go from design 
point A to design point B is 53% for a delay penalty of 23%. 

The static curve 420 has a design point C. The static curve 420 has a 
delay limit at approximately 1.42 nsec. The design point C is at a delay of 
approximately 1.62 nsec and a power consumption of approximately 4.5 mA. 
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Therefore, the design point C has approximately the same delay as the design 
point B of the domino curve 410 but has an additional power saving of 16%. 

The parameter curve therefore provide the design engineer an 
immediate visualization of the relationship between the design parameters, 
e.g., power, delay, so that optimization can be carried out. 

Figure 5 is a diagram illustrating an example of an arithmetic logic unit 
(ALU) datapath subsystem or functional block (FB) according to one 
embodiment of the invention. The ALU datapath FB 500 includes an input 
multiplexer (MUX) 510, a comparator 520, a static adder 530, and an output 
MUX 540. The ALU datapath FB 500 is a common design used in the 
processor 150 or the graphic processor 175 in Figure IB. 

In this illustrative example, the design parameters include power and 
delay. The parameter function is the power-delay curve. The constraint 
parameter is the propagation delay through the ALU FB 500 and the 
optimizing parameter is the power. The optimization is to minimize the 
overall power consumption while keeping the propagation delay within the 
specified design constraint. 

The input MUX 510, the" comparator 520, the static adder 530 and the 
output MUX 540 form a cascaded chain of circuit elements which has a critical 
path going from one end to the other end. The composite delay is the sum of 
the individual delays through each of the circuit elements. In addition, it is 
assumed that these circuit elements are active, e.g., the power consumption of 
the ALU FB 500 is the sum of the individual power consumption. 
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The design optimization phase includes the generation of the power- 
delay curves for the individual circuit elements. Then a trade-off analysis is 
performed on these power-delay curves. Each power-delay curve has several 
design points. Each design point corresponds to a design configuration of the 
5 circuit. In one embodiment, the design configuration is the transistor size 
characterized by a scale factor. The design optimization phase begins with a 
set of initial design points on the power-delay curves. These initial design 
points correspond to a composite delay that meets the specified timing 
constraint. The optimization process then proceeds to iteratively determine 
10 the new set of design points on the power-delay curves such that the specified 
timing constraint remains met while the total power is reduced. 

Figure 6A is a diagram illustrating a power-delay curve 610A for the 
input multiplexer shown in Figure 5 according to one embodiment of the 
invention. The power-delay curve 61 OA has two design points, A and B. The 
s 15 design point A has a delay value of 0.25 nsec and a power value of 3.2 mA. 

%Q The design point B has a delay value of 0.28 nsec and a power value of 1.6 

£ mA. A and B are the initial and new design points, respectively. The arrow 

shows the move from design point A to design point B during the design 
optimization phase. 

20 Figure 6B is a diagram illustrating a power-delay curve 610B for the 

comparator shown in Figure 5 according to one embodiment of the 
invention. The power-delay curve 610B has two design points, C and D. The 
design point C has a delay value of 1.12 nsec and a power value of 1.0 mA. 
The design point D has a delay value of 1.05 nsec and a power value of 1.9 
25 mA. C and D are the initial and new design points, respectively. The arrow 
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shows the move from design point C to design point D during the design 
optimization phase. 

Figure 6C is a diagram illustrating a power-delay curve 610C for the 
static adder shown in Figure 5 according to one embodiment of the 
5 invention. The power-delay curve 610C has two design points, E and F. The 
design point E has a delay value of 1.23 nsec and a power value of 10.0 mA. 
The design point F has a delay value of 1.37 nsec and a power value of 4.0 
mA. E and F are the initial and new design points, respectively. The arrow 
shows the move from design point E to design point F during the design 
10 optimization phase. 

Figure 6D is a diagram illustrating a power-delay curve for the output 
multiplexer shown in Figure 5 according to one embodiment of the 
invention. The power-delay curve 610D has two design points, G and H. The 
design point G has a delay value of 1.75 nsec and a power value of 4.0 mA. 
15 The design point H has a delay value of 1.65 nsec and a power value of 6.0 
mA. G and H are the initial and new design points, respectively. The arrow 
shows the move from design point G to design point H during the design 
optimization phase. 

The power and delay parameters obtained from the power-delay curves 
20 610A, 610B, 610C, and 610D have the following values: 

Initial design points: 

Total delay: 0.25 + 1.12 + 1.23 + 1.75 = 4.35 nsec 

Total current: 3.2 + 1.0 + 10.0 + 4.0 = 18.2 mA 

New design points: 
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Total delay: 



0.28 + 1.05 + 1.37 + 1.65 



= 4.35 nsec 



Total current: 



1.6 + 1.9 + 4.0 + 6.0 



= 13.5 mA 



Therefore, it is seen that the new design points B, D, F, H result in the 
same composite delay of 4.35 nsec, but with a 25.8% saving in power. 

The power-delay curves in Figures 6A, 6B, 6C, and 6D illustrate the 
optimization process by varying the variable design parameter and selecting 
the best overall values. The variable design parameter is common to all the 
curves. In this example, the variable design parameter is the transistor size, 
or the power of the block. 

The optimization process can be applied for different circuit 
configurations. For example, a circuit block can be designed using a static 
circuit technology or a dynamic (e.g., domino) circuit technology as illustrated 
in Figure 4. In another example, a circuit block may be designed using a 
multiplexer or a decoder. In these cases, the optimization process can be 
carried out based on the parameter function, e.g., power-delay curve. 

Figure 7 is a diagram illustrating a comparison of the power-delay 
curves for the three different implementation of an example circuit according 
to one embodiment of the invention. The power-delay curve 710, 720, and 
730 correspond to the initial, better, and worse designs, respectively. 

The power-delay curve 710 has high power consumption but fast speed. 
The power-delay curve 720 has a wider delay range and reasonable power 
consumption. The power-delay curve 730 is similar to 720 but the delay 
covers a slower range. 
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Suppose the design constraint is a delay of approximately 1.5 nsec. 
Under this timing constraint, it is seen that the design depicted by the power- 
delay curve 730 is not acceptable. Both designs depicted by the power-delay 
curves 710 and 720 are acceptable because they cover the specified timing 
5 constraint. However, the power-delay curve 720 shows a better design 
because at 1.5 nsec, it results in a 50% power reduction. 

Figure 8 is a diagram illustrating a design process 800 according to one 
embodiment of the invention. The design process 800 includes an initial 
phase 810, an optimization phase 820, and a final phase 830. 

10 In the initial phase 810, the design engineer generates the initial design 

parameters files for all circuits in the circuit block. The initial design 
parameters are selected according to some predetermined criteria. In most 
cases, they are selected based on the experience of the design engineer. The 
selected design parameters may or may not meet the design constraints. 

15 In the optimization phase 820, the design engineer optimizes the 

optimizing parameters according to the design constraints imposed on the 
constraint parameters. As part of the optimization phase 820, parameter 
functions are generated to facilitate the trade-off analysis. In one 
embodiment, the parameter functions are shown as parameter data files 

20 which contain values of the parameters at various design points. The 
optimization process can be done manually, automatically, or 
semiautomatically. In manual mode, the design engineer examines the 
parameter function and adjusts the design points in an iterative manner to 
improve the optimizing parameter(s) while keeping the constraint 

25 parameters to be within the specified range. In automatic mode, an 
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optimization program reads the parameter data files and process the data 
according to an optimization algorithm. Numerous optimization algorithms 
for numerical data exist. Examples include traditional search algorithms, 
branch-and-bound, scheduling techniques, and genetic algorithms. In 
5 semiautomatic mode, part of the optimization can be done manually and part 
is done automatically. The optimization is usually done iteratively. The 
optimizing parameters are selected to provide overall optimal values while 
keeping the constraint parameters within the specified design constraints. 
The iterative process is terminated when the optimal values are within a 
10 predetermined range. 

The final phase 830 generates the new design parameters files from the 
values determined by the optimization phase 820. The new design 
parameters are used for all the circuits in the subsystem or functional block. 

The product as a result of the above three phases may include a design 
15 environment, a computer program product, a software library, a script file, a 
program, or a function stored in any computer readable media. The elements 
vS of the product includes a code or program segment to create the parameter 

functions for the circuits, a code or program segment to optimize the design 
parameters based on the generated parameter functions, a code or program 
20 segment to configure each circuit in the subsystem or functional block or 
subsystem, a code or program segment to generate the values of the design 
parameters to be used in creating the parameter functions, a code or program 
segment to select the values of the constraint parameters to meet the design 
constraints, a code or program segment to determine the values of the 
25 optimizing parameters, a code or program segment to iterate the 

determination and selection of the values of the parameters, a code or 



i.e.; 
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program segment to size the circuit components, a code or program segment 
to select the circuit technology (e.g., domino versus static), and a code or 
program segment to perform CAD functions (e.g., netlist generation, timing 
analysis, power vector generation, timing and power calculations). As is 
5 known by one skilled in the art, these code or program segments may link, 
call, or invoke other programs or modules including the CAD tools. 

The present invention therefore is a technique to optimize the design 
of a subsystem or functional block having a number of circuits. The 
subsystem or functional block has a set of design parameters which are 

10 divided into two groups: optimizing parameters and constraint parameters. 
The technique includes the generation of parameter functions or data files 
which show the relationship between the design parameters. An 
optimization process is then carried out to select the optimal values for the 
optimizing parameters while keeping the constraint parameters to be within 

15 the specified range. The technique provides the design engineer a global 
picture of the overall design so that global optimization can be performed. 



tfi While this invention has been described with reference to illustrative 

Aft 

embodiments, this description is not intended to be construed in a limiting 
sense. Various modifications of the illustrative embodiments, as well as 
20 other embodiments of the invention, which are apparent to persons skilled 
in the art to which the invention pertains are deemed to lie within the spirit 
and scope of the invention. 
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